Noise-Robust environmental sound classification method based on combination of ICA and MP features

نویسندگان

  • Reona Mogi
  • Hiroyuki Kasai
چکیده

This paper presents an environmental sound classification method that is noise-robust against sounds recorded by mobile devices, and presents evaluation of its performance. This method is specifically designed to recognize higher semantics of context from environmental sound. Conventionally, sound classifications have used acoustic features in the frequency domain extracted from sound data using signal processing techniques. Although the most popular feature is Mel-frequency Cepstral Coefficients (MFCC), MFCC is inappropriate for mixture sound with noise. Independent Component Analysis (ICA) can extract sound characteristics even when the source is corrupted by noise because components within the source are assumed to be independent. In recent years, Matching Pursuit (MP) has been addressed to extract time-domain features. It has been applied to various applications. The feature is effective for recognizing and classifying environmental sounds that include time-variant sound such as birdsongs, alarms, and vehicle sounds. In this way, some innovative techniques have been proposed to recognize and classify environmental sounds recorded on mobile devices. However, we have not yet obtained a decisive method to attain a higher recognition and classification rate against environmental sounds with various noises such as unintended sounds and white noise. To address this problem, we propose a noise-robust classification method using a combination of Independent Component Analysis (ICA) and MP. It is possible to reduce noise effects for feature extraction. From performance evaluations, we confirmed that the proposed method can provide about 8% better classification than that of MFCC feature extraction.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Novel Noise-Robust Texture Classification Method Using Joint Multiscale LBP

In this paper we describe a novel noise-robust texture classification method using joint multiscale local binary pattern. The first step in texture classification is to describe the texture by extracting different features. So far, several methods have been developed for this topic, one of the most popular ones is Local Binary Pattern (LBP) method and its variants such as Completed Local Binary...

متن کامل

An Improvement in Support Vector Machines Algorithm with Imperialism Competitive Algorithm for Text Documents Classification

Due to the exponential growth of electronic texts, their organization and management requires a tool to provide information and data in search of users in the shortest possible time. Thus, classification methods have become very important in recent years. In natural language processing and especially text processing, one of the most basic tasks is automatic text classification. Moreover, text ...

متن کامل

Speech enhancement based on hidden Markov model using sparse code shrinkage

This paper presents a new hidden Markov model-based (HMM-based) speech enhancement framework based on the independent component analysis (ICA). We propose analytical procedures for training clean speech and noise models by the Baum re-estimation algorithm and present a Maximum a posterior (MAP) estimator based on Laplace-Gaussian (for clean speech and noise respectively) combination in the HMM ...

متن کامل

Environmental Sound Recognition With Time-Frequency Audio Features

The paper considers the task of recognizing environmental sounds for the understanding of a scene or context surrounding an audio sensor. A variety of features have been proposed for audio recognition, including the popular Mel-frequency cepstral coefficients (MFCCs) which describe the audio spectral shape. Environmental sounds, such as chirpings of insects and sounds of rain which are typicall...

متن کامل

Automatic classification of normal and abnormal cardiac sounds by combining features based on wavelet transform and capstral coefficients extracted from PCG signals (Research Article)

Cardiac sounds are produced by the mechanical activities of the heart and provide useful information about the function of the heart valves. Due to the transient and unstable nature of the heart's sound and the limitation of the human hearing system, it is difficult to categorize heart sound signals based on what is heard from a stethoscope. Therefore, providing an automated algorithm for prima...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Artif. Intell. Research

دوره 2  شماره 

صفحات  -

تاریخ انتشار 2013